Runtime characterisation of irregular accesses applied to parallelisation of irregular reductions
نویسندگان
چکیده
Irregular reduction operations are the core of many large scientific and engineering applications. There are, in the literature, different methods to solve these operations in parallel. In this paper we discuss a new technique which improves performance significantly, both in terms of execution time and memory overhead. These improvements are achieved in the preprocessing as well as in the resulting parallel code. Our proposal is based on the use of the Irregular Access Region Descriptor (IARD). This data structure is a compact characterisation of indirectly accessed arrays that can be used for the efficient parallelisation of a wide spectrum of irregular codes. In this paper we present its application to parallelise irregular reduction operations on a shared memory machine. We compare our proposal with two other competitive solutions.
منابع مشابه
Memory System Support for Irregular Applications
Because irregular applications have unpredictable memory access patterns, their performance is dominated by memory behavior. The Impulse con gurable memory controller will enable signi cant performance improvements for irregular applications, because it can be congured to optimize memory accesses on an application-by-application basis. In this paper we describe the optimizations that the Impuls...
متن کاملProcessing irregular codes containing arrays with multi-dimensional distributions by the PREPARE HPF compiler
The automatic parallelization of irregular codes for massively parallel systems is a challenging problem of growing importance. This paper describes techniques for processing irregular codes that are being implemented in the Prepare HPF compiler. Our work makes two contributions: (1) it constructs a library called Parti+ that is used for the runtime analysis of codes with runtime data dependent...
متن کاملEvaluation of Compiler and Runtime Library Approaches for Supporting Parallel Regular Applications
Important applications including those in computational chemistry, computational fluid dynamics, structural analysis and sparse matrix applications usually consist of a mixture of regular and irregular accesses. While current state-of-the-art run-time library support for such applications handles the irregular accesses reasonably well, the efficacy of the optimizations at run-time for the regul...
متن کاملImproving Compiler and Run-Time Support for Irregular Reductions Using Local Writes
Current compilers for distributed-memory multiprocessors parallelize irregular reductions either by generating calls to sophisticated run-time systems (CHAOS) or by relying on replicated buuers and the shared-memory interface supported by software DSMs (TreadMarks). We introduce LocalWrite, a new technique for parallelizing irregular reductions based on the owner-computes rule. It eliminates th...
متن کاملOptimizing Irregular HPF Applications using Halos
This paper presents language features for High Performance Fortran (HPF) to specify non-local access patterns of distributed arrays, called halos, and to control the communication associated with these non-local accesses. Using these features crucial optimization techniques required for an e cient parallelization of irregular applications may be applied. The information provided by halos is uti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJCSE
دوره 1 شماره
صفحات -
تاریخ انتشار 2005